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Abstract 

We develop a general theory for three states of equilibrium of amyloid peptides: the monomer, 
oligomer, and fibril. We assume that the oligomeric state is a disordered micelle-like collection of a 
few peptide chains held together loosely by hydrophobic interactions into a spherical hydrophobic 



c^— I ' core. We assume that fibrillar amyloid chains are aligned and further stabilized by 'steric zipper' 

o ■ 

^^ ■ interactions - hydrogen bonding and steric packing, in addition to specific hydrophobic sidechain 



contacts. The model makes a broad set of predictions, consistent with experiments: (i) Similar to 
surfactant micellization, amyloid oligomerization should increase with bulk peptide concentration. 



Q ■ (ii) The onset of fibrillization limits the concentration of oligomers in the solution, (iii) The average 

o ! 

fibril length vs. monomer concentration agrees with data on a-synuclein, (iv) Full fibril length 
^ . distributions follow those of a-synuclein, (v) Denaturants should 'melt out' fibrils, and (vi) Added 

^^ \ salt should stabilize fibrils by reducing repulsions between amyloid peptide chains. Interestingly, 

cn ■ 

X/^ ■ small changes in solvent conditions can: (a) tip the equilibrium balance between oligomer and 

L-^ . fibril, and (b) cause large changes in rates, through effects on the transition-state barrier. This 

model may provide useful insights into the physical processes underlying amyloid diseases. 
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I. INTRODUCTION 

What are the forces that stabihze aggregates of amyloid peptide molecules? This question 
is of interest because of the putative role played by amyloid aggregation in diseases such as 
Alzheimer's, Parkinson's, Mad Cow and type II diabetes [1]. Amyloid appears to aggregate 
into at least two different states: amyloid oligomers^ which are small few-chain soluble 
disordered clusters, and fibrils^ which are long many-chain highly structured /3-sheet-like 
aggregates. The view has recently emerged that the oligomers may be the toxic species, 
not the fibrils, as had been expected because of the appearance of plaques in disease [2]. It 
has been challenging to understand the physical principles of amyloid aggregation, in part 
because of a lack of reductionist experimental model systems. In this breach, we believe 
that simple models can help guide and interpret experiments. 

The first challenge is to disentangle how much of amyloid formation can be explained by 
equilibrium vs. kinetics. Often kinetics is easier to study experimentally because measuring 
rates does not require finding conditions of reversibility. Yet, there is experimental evidence 
of multiple stable states: monomers, oligomers, and fibrils, and perhaps a precursor to the 
fibrillar state, called the protofilament [3]. We believe insights can be gained from first 
understanding the underlying phase equilibria. 

II. MODELING THE STATES OF AMYLOID AGGREGATION 

We develop here a model of the equilibrium among the following states: (a) isolated 
monomeric amyloid peptide molecules in solution, (b) few-chain noncovalent aggregates 
(oligomers) of amyloid peptide molecules, (c) the single 'macroscopic thread' called a protofil- 
ament^ which is a noncovalent ordered assembly of many chains, and (d) the fibril, which 
is a bundle of protofilaments. These states are shown schematically in Figs. [1] and [2l (In 
this paper, 'monomer' refers to an individual peptide chain, not to a single amino acid in 
a chain; see Fig. [2^) [3|-l5|. Our interest here is in peptides, such as A/3, a-synuclein, and 
lAPP, that do not have single-chain native folded structures, so our model below neglects 
any possible additional equilibria with a native folded structure. 



A. Modeling the oligomer state 

The ohgomer state is shown in Fig. [2^. For A/3 aggregates, ohgomers range in size from 



trimers to hundreds of monomers 



M. 



We model the amyloid oligomer state as a disordered 
spherical globule having a hydrophobic core containing A^ peptide chains. Each chain has L 
amino acids. We approximate the free energy of oligomerization, AFoUgo, from state A to B 
in Figure [21 in terms of the transfer of the NL amino acids from water into the oligomeric 

core as (y, [Zl] , 

A F 
AFoHgo = -^ = -xNL, (1) 

where kTx is the free energy of transfer per amino acid and x is the Flory-Huggins parameter, 
averaged over the amino acid composition of the peptide and over the solvent accessibilities 



of the various amino acids. Following recent work [8|], we neglect the distinction between 
interior and surface residues that was drawn in older models [y]. 



B. Modeling the protofilament and its nucleus 

The /3-sandwich motif, shown in Fig. [lb, is a common feature of amyloid aggregates 



observed in NMR and X-ray structures 



0ll2 



12| . Here, we assume that the basic structural 



element comprising protofilaments and fibrils is the /3-sandwich. Before describing our fibril 
model, we first define our terminology for structures throughout this paper. A ^-strand is 
a single linear stretch of peptide chain. A f3 -sheet is comprised of two or more hydrogen- 
bonded 13 -strands (see Fig. [lK,b). A /3 -sandwich is two planar /3-sheets face-to- face (see Fig. 
[lb). For example, in the fibrillar state the A/3 molecule is a ^/-shaped /3-hairpin comprised 
of two beta strands. In amyloid fibrils, the /3-sandwich is stabilized by H-bonds parallel to 
the fibril axis and by hydrophobic and van der Waals interactions from the interdigitation 
of side chains within the steric zipper between the two /3-strands (see Fig. [lb) [9]. At a 
given stage of fibrillization, we assume a /3-sheet is composed of h /3-strands. Each /3 strand 
contributes b amino acids to the /3-sheet. The sheet width b must satisfy b < i^ where i < L 
is the length of the /3-strands in the mature fibril. At a given stage of fibril formation, the 
total number of residues in the /3 state is m = 26/i, where the factor of two accounts for the 
two sheets in the /3-sandwich. The quantities b and h are shown schematically in Fig. [2b, d. 
The quantity m serves as an order parameter for the extent of fibril formation. 



III. 



We treat the equilibrium between the ohgomer (state B in Fig. [2j), the fibril nuclei (states 
C and D in Fig. [2j), the protofilament (E in Fig. [2j), and the full fibril (F in Fi g. [21) in a way 
that resembles the standard treatment of the helix-coil transition in peptides fJi 

We call the states BCDE the fibril ordering pathway. The free energy is 

AFBCDE(m) [m 

-X[NL-m)-mlngs- X —In-f, (2) 



kT '^' ' "^ V 2 

as a function of m, which can be regarded as an 'order parameter' or a 'reaction coordinate' 
along the route BCDE. m ranges from m = when the system is fully disordered {i.e., 
fully in state B), to m = NL when the system is fully ordered in the /3-state {i.e., fully in 
state E). (So, in normalized form, a reaction coordinate could be expressed as ^ = m/NL). 
The first term in Eq. [2] is the free energy of converting m of the NL amino acids from 
their oligomeric disordered state, with a corresponding loss of the disordered micelle-like 
hydrophobic interactions. 

The second term in Eq. O —mkT In gs^ is the free energy of forming m pairwise steric 
zipper interactions in the core of the fibril, gg is a dimensionless propagation equilibrium 
coefficient, gg resembles the quantity s in helix-coil theories ll3|, ll6|], except that gg here 
describes (3 structure, not a-helical structure, gg captures various types of interactions, 
including conformational entropy, hydrogen bonds, steric packing, and ordered sidechain 
hydrophobic interactions. A necessary condition for fibril formation is gg > 1. That is, 
fibrils can only form when the sterically zipped state (hydrogen bonds, packing, and ordered 
hydrophobic interactions) is more favorable than the monomeric state. The subscript s 
here in gg indicates an interaction within a single /3-sandwich, not between the different 
/3-sandwiches that make up a full fibril. 

We treat fibrillar ordering as a surface/interior nucleation process. The third term in Eq. 
[21 ^m/21n7, is the free energy of initiating steric zipping at the edge of the /3-sandwich 



(a square having m/2 residues, has a perimeter with \/rn/2 residues). In our model, 7, is 
a surface tension for forming the perimeter bonding. In the metaphor of helix-coil theories, 
7 resembles a, the helix-coil nucleation parameter. Fig. [2] shows that the edge of the /3- 
sandwich has 26 unsatisfied H-bonds, but only b unsatisfied hydrophobic contacts due to the 
stagger between the two sheets. It is these missing hydrophobic and H-bond interactions that 
account for why there should be a barrier, 7 < 1, to nucleating the fibril. This nucleation 
barrier free energy in Eq. [2] is maximal for b = i. At the present stage of knowledge 



of microscopic structures, this square-shape approximation has the advantage of simphcity 
and is adequate to capture the shift in the ohgomer-fibril transition from hydrophobic to 
combined hydrophobic and hydrogen bonding interactions (l8l |. 

This model gives insight into fibril formation rates. We compute the free energy of the 
transition state by finding the maximum value of AForder along the reaction coordinate m 
using Equation [21 The transition state is at AF* = (rfAForder/<^^)m* = 0, so given by 

AAF^ ^ AFo,derK)-AFo,der(0) _ In' 7 .gx 

kT kT 8(ln(^,)-x)' ^' 

Eq. [3] shows that the free energy barrier will be extremely sensitive since the quantity 
ln{gs) — X in the denominator will be small. Ingg and x are dimensionless quantities of 
order unity; their diflFerence is small because the zipping free energy is expected to be only 
slightly more favorable than amorphous hydrophobic interactions. Small variations in gg or 
X, at the level of single amino-acid changes or slight changes in solution conditions, could 
change fibrillization rates by several orders of magnitude il9|. This provides a rationale 
for understanding how a single mutation could transform a normal fibrillization rate, which 
might be too slow to cause disease on a human lifetime, into a much faster fibrillization rate, 
sufficient to cause disease during a human lifetime. 

Our model of the amyloid nucleation process differs from classic nucleation mechanisms in 
two respects: (1) ours involves a one-dimensional line tension, rather than a two-dimensional 

n 

surface tension [20||, and (2) our ordering transition is from oligomers to fibrils, not from 
monomers to aggregates, so our fibrillization mechanism is not driven by increasing the solu- 
tion concentration of monomers. This is consistent with experiments showing that amyloid 
nucleation is concentration-independent. The proposed explanation in the Nucleated Con- 



formational Conversion (NCC) model [21| is that oligomeric chains must enter an activated 
conformation to proceed to fibrils. In our model, the role of activation is played by the 
entropically unfavorable steric-zipper nucleus. 

C. Modeling the full fibril 

In our model, a full fibril consists of p /3-sandwich-motif protofilaments stacked and 
bundled together. Fig. [2^ shows such a fibril for p = 2. AFef is the free energy of bundling 



protofilaments into fibrils, 

^%^ = ^ = nLe = -nL In ( ^] (4) 

where L remains the number of amino acids per peptide chain, n is the number of peptide 
chains in the bundled fibril, g is the propagation constant for forming /3 structure in the 
mature fibril. Eq. [4] gives — In^' = —Ings + e^ so — Ingg is the free energy of the interactions 
within a single protofilament, and e is the interaction energy holding the protofilaments 
together. Because these bundling interactions occur only between a subset of residues on 
the perimeter of the protofilament, we expect that g and gs will be similar. 

Peptides are frequently folded within a fibril such that a single peptide chain contributes 
multiple /3-strands to the fibril. To account for this we introduce the parameters ^, the 
length of each /3-strand, and n^, the number of (3 strands formed by each peptide chain. For 
example, peptides such as A/3 and lAPP that form a single hairpin in the mature fibril have 
Us = 2. The quantity n^, defined such that L = Ugi is shown in Fig. [3l 

From Eq. [2] it is clear that each protofilament will incur a nucleation penalty — kTi In ^ 
and each /3-strand in the fibril will contribute a binding energy —kTllng. However, a more 
convenient quantity is the binding energy per peptide —kTlnglng = —kTLlng. The free 
energy of a fibril consisting of n peptides is then 



kT kT 



= -ln(7^V'^). (5) 



The fibrillization index n must be greater than the minimum fibril size no, however we will 
find that the physical observables are insensitive to the precise value of ng. This is in contrast 
to the oligomer size N which plays an important role in the phase behavior. 

D. The monomer-oligomer-fibril assembly equilibrium 

Now, we combine the free energies above into a grand canonical ensemble to determine 
how the assembly equilibria depend on the concentration of peptide monomers in solution. 
If the oligomeric state resembles a micelle, a reasonable approximation is that the oligomer 
species is dominated by a single aggregation number, with free energy given by Eq. [H 
However, for the fibril, we assume a continuum of aggregation states having number n 
peptide chains and a free energy given by Eq. [51 To compute the properties of the solution, 
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we compute the binding polynomial [2^ 

n 

In Eq. [6l Ci is the concentration of monomers, c^e^^^ is the concentration of ohgomers, and 
the final term is the sum over the concentrations of the fibrils of all possible lengths. The 
total peptide concentration, cq can be computed by using the concentration of each species 
and summing the number of peptides in each species. Thus, Cq is written as 

CO = ^ = c, + iVcf e>^^^ + Y' E nc-,g-\ (7) 

n=no 

where no is the smallest accessible fibril size. 

The solution phase behavior is given by the peptide concentrations in each of three states: 
monomer, Ci; oligomer, CoUgo; and fibril, Cfibrii, where 

Coiigo = Nc^e^'''^ (8) 

and 

oo 

Cfibril = Y' Yl '^^l^"'^ (9) 

n=no 

are the component terms from Eq. [71 The three relative concentration quantities, 
Ci/co, Coiigo/co and Cfibrii/co must sum to one. To compute the phase diagram we numer- 
ically solve Eq. [7] for Ci at fixed values of Cq, g^ 7, and x- The concentrations of peptides 
in the fibril and oligomer states are then computed from Eqs. [8] and M Fig. [5] shows 
the computed phase diagrams. The boundaries in Fig. [5] represent the conditions of equal 
populations of the two corresponding states. In the Methods section we derive analytic 
expressions for the phase boundaries. These are shown by the black lines in Fig. [5l The 
model predictions are given in the following section. 

III. RESULTS 

The model defined in the previous section leads to a free energy landscape with features 
schematically shown in Fig. [H In the following we compute the phase equilibria resulting 
from this landscape. 



A. The model predicts an amyloid triple pointy a 3-state equilibria 

Fig. [5] shows the phase diagram computed from Eq. [3 The x-axis shows the monomer 
concentration. The y-axis shows Ing/x^ the ratio of the free energy for a steric zipper to 
the free energy for amorphous hydrophobic aggregation. The model predicts three main 
features. First, increasing the amyloid peptide concentration in solution leads to increased 
aggregation (both oligomers and fibrils). Second, not surprisingly, at high peptide concen- 
trations, changing solution conditions to favor steric zipping tips the balance from oligomers 
toward fibrils. This phase equilibrium line is relatively fiat, indicating that it is not very 
dependent on monomer concentration. Third, there should be a triple point, a particular 
monomer concentration and solution condition at which monomers, oligomers, and fibrils 
are all present in equal populations. 

The phase diagram can be closely approximated by comparing the critical fibril concen- 
tration (CFC) to the critical oligomer concentration (COC) (see Methods). The lesser of 
these two quantities determines the aggregate species that appears upon raising the peptide 
concentration. However, if the COC is less than the CFC it may be possible to drive the 
solution from the oligomer state to the fibril state by further raising the peptide concentra- 
tion. This transition may be computed using the fibril-oligomer coexistence condition (see 
Eqs. \2M and 1381) . The converse is not true; it is not possible to reach the oligomer phase from 
the fibril phase by increasing the peptide concentration. This asymmetry arises from the 
definitions of the critical concentrations. The CFC is defined by the radius of convergence of 
Eq. [TJ and thus it sets a hard limit on the achievable monomer concentration. On the other 
hand, the monomer concentration will still rise, albeit weakly, upon reaching the COC, and 
therefore it is possible for the monomer concentration to reach the CFC even after oligomers 
have begun to form provided the CFC is not much greater than the COC. 

B. When fibrils are stable, oligomers are not. 

Interestingly, the model predicts a 'sponge-like behavior': under fibril- forming conditions, 
amyloid peptide will be 'soaked up' into the fibrils and depleted from the oligomers. To see 
this, substitute the CFC, Eq. [16l into Eq. [HI to get 

Congo -^iVe-^^(^-^-^). (10) 



This small quantity, e~(^^5'-x) < i^ jg raised to a large power, LN. So, unless x ^^d In^' 
are closely matched, the concentration of the oligomeric state will be negligible under fibril- 
forming conditions. The implication for disease is that if oligomers are toxic, promoting 
fibril formation may deplete the toxins. 

C. Fibril concentration increases as a nonlinear function of monomer concentra- 
tion. 

Fig. [6] compares the theory for how the fibril population, Cfibrii, depends on peptide con- 
centration, to the experiments of Terzi et al. I23l. Since the N-terminal 11-16 residues of 

nn 

A/3 are disordered p^, 12J] in the fibril state, we take L = 26. In order to fit the data we 
convert the experimental concentration cm (in Molar units) to the dimensionless concen- 
trations required in our treatment. We use c = {cm/55'5)M. From that fit, we find that 
— Lln^' = —13.1 and —piln^ = 15.5 at the experimental conditions of 278 K and pH 7.4. 
This is an order of magnitude stronger than the per-residue binding energy within native 



B 



proteins |8|]. 



D. Fibril lengths undergo a 'growth transition' vs. monomer concentration. 

Now we compute the distribution of fibril lengths. The probability P{n) that a fibril has 
a length n is given by 

P{n) = ^^^—-. (11) 

What are the average fibril lengths? In the Methods section we show that the average 
length scales as Cq 7"^^/^, in agreement with the concentration dependence found in ref- 
erence [26]. Fig. [7] shows the predictions of Eq. [26] compared to the average fibril- length 
measurements of van Raaij et al [25]. From the fit we find, g^^ which determines the onset of 
fibrillization and we find 7^^, which determines the fibril length. We obtain — Lln^' = —15.5 
and —£j) In 7 = 32.3. In a-synuclein fibrils, it is found that p = 4, twice the value of A/3 
fibrils [25| . Accounting for this factor of two shows that the values of Ling and ^ In 7 are 
quite similar to those determined for A/3 in the previous section. Ug and i are not yet known 
for a-synuclein. 



Fig. [8] shows the prediction that the fibril lengths follow an exponential distribution; see 
Eq. 



E. Denaturants destabilize the fibrils and oligomers 

What is the eflFect of denaturants and osmolytes on amyloid aggregation? First, because 
both oligomers and fibrils are stabilized by hydrogen bonding and hydrophobic interactions, 
denaturants should 'melt out' amyloid aggregated states. A more subtle question is how 
denaturants shift the oligomer-fibril equilibrium. Fig. [9] shows the model predictions (1) 
that denaturants such as urea, not unexpectedly, should weaken hydrophobic and hydrogen 
bonding interactions, disrupting aggregation, (2) that more denaturant is required to disrupt 
aggregates if the amyloid concentration is high, and (3) that adding denaturant to fibrils 
can drive the system into the oligomer state. 

We note two additional points. First, in apparent contradiction to this prediction, de- 
naturants are sometimes used to promote fibrillization, but that appears to be observed 



exclusively in systems having a native folded state J27l-l29|. unlike the systems we model 
here. Second, the present model resolves a paradoxical result in the literature. Chen and 
Glabe found that urea drove fibrils to 'melt' directly to monomers without passing through 
the oligomer state [4]. In contrast, Kim et al. found that urea drove fibrils to melt to 
oligomers, when then melted to monomers j30|. Our Fig. [9] gives an explanation: the 
A/3 peptide concentrations used by Kim et al. were 3- 10- fold greater than those used by 
Chen and Glabe, shifting to a region of the phase diagram in which oligomers are a stable 
intermediate phase. 

F. Electrostatic repulsion destabilizes the fibrils 

To treat the effects of pH and salt, we express the binding free energy 

- L\iig{q,Cs) = -Llngo + AFes{q,Cs) (12) 

in terms of g^^ which accounts for the binding energy for a reference peptide having zero net 
charge, and an electrostatic component, AFg^, which is the free energy of charging up the 
peptides from their uncharged state to a net charge q in the presence of a salt concentration 
Cg. The latter is given quantitatively by Eq[43l 
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Fig. [To] compares a simple calculation of the electrostatic repulsions with experimental 
measurements of dependence of the critical concentration, which we identify with the theo- 
retical quantity c\ \ on salt concentration. We compute a charge of q = —3.9 at pH 9.0 and 
q = —2.8 at pH 7.4 for the A/3 peptide j3]|. We treat each peptide as a cylinder of radius 
R = lOnm [32] using Poisson-Boltzmann theory; see Eq. HHl Fig. [TOl shows two experiments: 
Klement et al. for pH 9 and Terzi et al. for pH 7.4. The sole fitting parameter in Fig. [TOlis 
—Lingo = —21.9. This corresponds to — Lln^' = —15.5 under the pH 7.4, 5mM conditions 
of Terzi et al. This represents a good agreement with our previous result of — Lln^' = —13.1 
given the extreme sensitivity of the electrostatic free energy at such low salt concentrations. 

IV. CONCLUSION 

We have described a general thermodynamic theory for the aggregation of short peptides 
into globular and fibrillar aggregates. The model predicts two transitions: (1) a micelle- 
like transition of monomeric peptides in solution to an oligomeric state involving a loose 
hydrophobic core and a loss of translational entropy, and (2) a transition from the disordered 
globular oligomeric state to an ordered /3-structured fibrillar state, driven by tighter packing, 
hydrogen bonding, and steric and hydrophobic interactions. 

We find good agreement of the model with experiments on fibril concentrations, average 
fibril lengths, and fibril length distributions vs. monomer concentrations. We find that 
the phase boundaries and transition states are highly sensitive to small changes in solution 
conditions and protein properties. Such sensitivities may be relevant to aggregation processes 
in amyloid diseases. 

V. METHODS 

Critical fibril concentration (CFC) 
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The concentration of peptides in the fibril state is given by the final term in Eq. [7] 

oo 



n=no 

oo 



^ -^ ^ n=no 

(1 - ci^^)2 

The sum in Eq. [13] converges when the argument is less than unity, therefore the monomer 
concentration must satisfy Ci < g~^ if the bulk concentration is to remain finite. This radius 
of convergence defines a concentration 

c[^^ - ^-^ (16) 

which may be interpreted as the CFC for the fibril solution. For Ci much less than this value 
the concentration of monomers in the fibril state Cfn^rii is strongly suppressed by the factor 
Y^{cig^)^^. However, as Ci approaches g~^ the fibril concentration diverges. In this regime 
we can write Cig^ = 1 — 5 and the fibril concentration becomes 

c,..,, - y. (l-"^^)(l;"o^-^) (17) 

which demonstrates that Cfn^rii is insensitive to the lower limit no- 

Note that our definition for the CFC differs from that given in reference [26| . 
Critical oligomer concentration (COC) 

Due to the importance of oligomers in disease progression, we would like to calculate 
the concentration of oligomers both in the presence and absence of fibrils. In the presence 
of fibrils we can approximate the concentration of peptides in the oligomer state, CoHgo^ by 
using the CFC in the second term of Eq. [71 since the monomer concentration varies little in 
the vicinity of the fibril CFC 

CoHgo - iV(7-^^e>^^^. (19) 

This expression is only valid provided e^ < g. If this condition is not satisfied, then Eq. 
[2] is a monotonically increasing function of m meaning the oligomer state has a lower free 
energy per peptide than a fibril of any length. In this case no fibrils will be formed, and the 
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solution will be an equilibrium mixture of oligomers and monomers. We define the COC to 
be the concentration where the oligomer and monomer states have equal occupancies. Using 
the appropriate terms from Eq. [3 we find the COC given by 

(COC) fe-^^^V'^''-'^ 

Fibril-oligomer boundary 

The boundary between the fibril and oligomer phases is defined, for points suitably re- 
moved from the monomer phase, by the condition Cfn^ru = CoUgo — Co/2. Using Eqs. fTSland 
[I9]we have 

^0/2 = 1'' , ''^\,, (21) 

Co/2 = A^cfe^^^. (22) 

Eq. [2T] yields a recursive formula for ci , which to lowest order gives 

Ci^^-^(l-^2yV^) (23) 

which can be combined with Eq. |221 to yield a condition for the phase boundary 

K£ ^ NLlivg 

X ln(^^)-iVln(l-y2yV^)' 

This expression is plotted with long dashes in Fig. [5] 
Average fibril length 

The critical concentrations for fibril and oligomer formation, c^^ and c^^^ are notably 
lacking a dependence on the nucleation parameter 7. While this parameter has little effect 
on the relative stability of the fibril and oligomer phases, we expect that it will play a large 
role in determining the equilibrium lengths of mature fibrils. To see this we consider a system 
that is deep within the regime where fibrils are the dominant species so that Cq — Cfn^ru- 
Using Eq. [18] we find 

ci2.^-^(l- Vt'Vco). (25) 

The jth moment of the fibril length distribution is given by 

(n^) ^ t^}^ (26) 

^ ' ci/-— ^ f^^ . (27) 



l-cig^j \ d{cig^)J \1 - Cig 
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The average length is given by the first moment j = I 

- cy'7-^^/2 + const, (29) 

where Eq. [25] has been used to extract the scaling behavior. 

The lack of a dependence on g in this scaling relation is a result of an approximation 

( f) 
based on cq :» c\ . When this assumption is satisfied the large majority of protein is in the 

fibril state, and the problem of determining the fibril lengths is reduced to a question of fibril 

breakage statistics. Since each breakage incurs a statistical penalty 7^^, the functional form 

of Eq. [29] is not surprising (the factor of two in the exponent is a result of the degeneracy 

of breakage points). For systems near the onset of fibrillization cq — cY the fibril lengths 

depend sensitively on g and Eq. [26] must be used to model the lengths. 

Denaturants 

Denaturants destabilize the folded states of proteins by weakening hydrophobic interac- 
tions and peptide-peptide H-bonds relative to peptide-solvent H-bonds [33| . To capture the 
effect of denaturants on the interaction free energy quantities g and 7, we use 

kTlngc = kT {In g - aoCd) 

kT In jc = kT{lnj + aoCd), (30) 

where gc and 7c are the propagation parameters in the presence of denaturant, q is the 
denaturant concentration and ag is a constant describing the destabilizing effect of the 
osmolyte. This form refiects the fact that the denaturant weakens the H-bonds captured 
in g, but also reduces the fibril end free energy — In 7, which arises largely from unsatisfied 
H-bonds. 

Using Eqs. [HI [20l and [30] we can compute how the critical concentrations will shift as 

( f) 
a function of denaturant concentration. At the onset of fibrillization we have Cq = c\ \ so 

from Eqs. [T6] and [30] we have 

Inci = — Lln^'c (31) 

= -L{lng -aoCd), (32) 

which can be solved for q to give the phase boundary 

Inco + Lln^ 

Cd= 7 . (33) 

Lao 
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Similarly, for the oligomer state we write 

Xc = X - o^iCd (34) 

where ai has been introduced to reflect the fact that since the oligomers are more dependent 
on hydrophobic interactions and less dependent on H-bonds for stability, and therefore, the 
destabilization coeflScient will, in general, be diflFerent. The onset of oligomerization may be 
determined from Eq. [201 

CO = ( ^ j , (35) 

which can be rearranged to yield 

1 / , {N-l)lnco + lnN \ 
^^ = ^(,^ + NL )■ ^''^ 

The flbril-oligomer boundary can be derived from Eqs. [2T] and [221 

lnco/27V = NL{x - ^iq) - NL{lng - ao^), (37) 

where we have dropped the correction term in Eq. [211 In Fig. [9] we take ai = ao/2 reflecting 
our expectation that the hydrophobic interactions stabilizing the oligomer are less affected 
by the presence of denaturant than the H-bonds stabilizing the flbril J8|, |3J|. However, the 
particular choice of ao/2 is for illustration purposes. With this approximation for ai Eq. [371 
becomes 



Cd 



(ln^-NL{x + lng)). (38) 



aoNL 
Based on denaturation studies, we expect that Qq = 0.022M~^ for urea and 0.042 for 

guanidinium [8[]. Using our estimates of Lln^' = 13.1 and ip In j = —15.5 (in the absence 

of urea) for A/3 from our previous analysis of fraction flbril as a function of concentration 

(Fig. [6]) we can predict the flbril fraction as a function of urea using Eq. [30] without a flt 

parameter. Our prediction is compared with the experimental data in Table [H 

Electrostatics 

To compute F^s we approximate the flbril as a smooth cylinder of radius R and uniform 
charge density. The linear charge density may be computed by noting that the average 
charge per /3-strand is q/ris and there are 2p strands per layer in the flbril. 

To determine the peptide charge as a function of pH, we use 

acidic residues -\ cwH—vKa basic residues 



lQpH-pKa^ basic residues ^QpKai-pH 

^^ Z^ ~ 1 _L 1 npH-pKai + 2^ "^1 1 ^npKa^-pH^ V^^^ 



15 





Experiment [30] 


Theory 


[urea] 


Oligomer fraction 


Fibril fraction 


Fibril fraction 


0.4M 





0.37 


0.30 


2 


0.37* 


0.03 


4 


0.2 





10"^ 


6 


0.22 





10-5 



TABLE I: Computed fraction of peptide in the fibril phase compared to the data of Kim et al. 

y. 

* Aggregate fraction was observed to be a combination of fibrils and oligomers. 



where the pKas of the amino acids are taken from Ref. |3l|. So, the charge density on each 
peptide-molecule cylinder is p = 2qp/nsa^ where a = 4.7 A is the spacing between amino 
acids. 



We then solve for the electrostatic potential ip using the Poisson-Boltzmann equation 

(40) 



1 d dip 
r dr dr 



where the ion concentrations are c± = c^e^^^/^-^. In the linearized (Debye-Huckel) approxi- 
mation, Eq. [40] has the solution 

2q 



il){r) = 



-Ko(fi:r), 



(41) 



however, we use the numerical solution of the nonlinear Eq. [40]as the dimensionless potential 
eip/kT can reach values in excess of unity at the low salt concentrations we consider. Here 
K~^ is the Debye length, defined via k^ = 2e^Co/ (e/cBT). At infinite dilution the appropriate 
boundary conditions are ip\R) = —p/27reR and '0((X)) = 0, but for the purposes of the 
numerical solver we employ the outer boundary condition ip\d) = corresponding to a 
solution of fibrils separated by an average distance 2d. We take d = R + 5tv~^ (see Fig. [TTI) . 
For d ^ K~^ the influence of the outer boundary condition will be minimal. 

Once we have computed ^0, we get the electrostatic free energy density of the peptide 



cylinder as 



35| 



/=-(-—) + /cr(c+ \li{Cj^/Cs) + c_ \ii{c_/cs) -Cj^-c_+ 2Cs). 



(42) 
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The first term in Eq. [12] is tlie electrostatic energy stored in the electric field, and the 
remaining terms account for the translational entropy of the ions in the screening layer. We 
then compute the free energy per peptide using 

/*CX) 

AFes = 27r^ / f{r)rdr (43) 

^P J R 

where r is the radial coordinate perpendicular to the axis of the cylinder. 



[1] 

[2] 
[3] 
[4] 
[5] 
[6] 
[7] 
[8] 
[9 

[lo; 
[11 

[12 
[13 
[14 
[15 
[16 

[17 
[18 

[19 

[2o; 

[21 



C. M. Dobson, Nature 426, 884 (2003). 

J. Hardy and D. J. Selkoe, Science 297, 353 (2002). 

S. Chimon et al, Nat. Struc. Mol. Bio. 14, 1157 (2007). 

Y. Chen and C. Glabe, J. Biol. Chem. 281, 24414 (2006). 

C. Glabe, J. Biol. Chem. 283, 29639 (2008). 

K. Dill, Biochemistry 24, 1501 (1985). 

K. Dill and D. Stigter, Adv. Prot. Chem. 46, 59 (1995). 

K. Ghosh and K. Dih, Proc. Nat. Acad. Sci. 106, 10649 (2009). 

R. Nelson et al, Nature 435, 773 (2005). 

T. Luhrs et al, Proc. Nat. Acad. Sci. 102, 17342 (2005). 

M. Sawaya et al, Nature 447, 453 (2007). 

C. Wasmer et al. Science 319, 1523 (2008). 

B. Zimm and J. Bragg, J. Chem. Phys. 31, 526 (1959). 

J. Schellmann, J. Phys. Chem. 62, 1485 (1958). 

S. Lifson and A. Roig, J. Chem. Phys. 34, 1963 (1961). 

D. Poland and H. Scheraga, Theory of Helix-Coil Transistions in Biopolymers (Academic 
Press, New York, 1970). 

K. Ghosh and K. Dih, J. Am. Chem. Soc 131, 2306 (2009). 

S. Auer, F. Meersman, C. Dobson, and M. Vendruscolo, PLOS Comp. Biol. 4, el000222 

(2008). 

J. Jarrett, E. Berger, and P. Lansbury, Biochemistry 32, 4693 (1993). 

J. Zhang and M. Muthukumar, J. Chem. Phys. 130 (2009). 

T. Serio et al. Science 289, 1317 (2000). 

17 



[22] K. Dill and S. Bromberg, Molecular Driving Forces (Garland Science, New York, NY, 2002). 

[23] E. Terzi, G. Holzemann, and J. Seelig, J. Mol. Bio. 252, 633 (1995). 

[24] A. Petkova et ai, Proc. Nat. Acad. Sci. 99, 16742 (2002). 

[25] M. van Raaij, J. van Gestel, I. Segers-Nolten, S. W. de Leeuw, and V. Subramaniam, Biophys. 

J. 95, 4871 (2008). 
[26] C. Lee, Phys. Rev. E 80, 031922 (2009). 
[27] D. Hamada and C. Dobson, Prot. Sci. 11, 2417 (2002). 

[28] A. Ahmad, I. Millett, S. Doniach, U. VN, and A. Fink, J. Biol. Cliem. 279, 14999 (2004). 
[29] S. Wang, Y. Hung, P. Wang, and J. Wu, Korean J. Chem. Eng. 24, 787 (200). 
[30] J. Kim, A. Muresan, K. Lee, and R. Murphy, Prot. Sci. 13, 2888 (2004). 
[31] A. Sillero and J. Ribeiro, Analytical Biochemistry 179, 319 (1989). 
[32] K. Element et al, J. Mol. Biol. 373, 1321 (2007). 

[33] M. Auton, L. M. F. Holthauzen, and D. W. Bolen, Proc. Nat. Acad. Sci. 104, 15317 (2007). 
[34] D. Bolen and G. Rose, Ann. Rev. of Biochem 77, 339 (2008). 
[35] D. Andelman, Proceedings of the Nato ASI & SUSSP on "soft condensed matter physics 

in molecular and cell biology" (Taylor & Francis, New York, 2006), chap. Introduction to 

electrostatics in soft and biological matter, pp. 97-122. 



18 




\ 



/ \ / \ / V /,\ 

/ V / ^ '^'' V 



U 




:2. 

f 




zW^ 




>7>7 









\,v, ^>, ^^ ^^ ,si^ ?:^ r^ 



^ 



^ y ^ > .^^ > >; A > x?^ / 




xxiii'^-jn: 



i 



FIG. 1: Assembly hierarchy of amyloid fibrils shown in atomistic cartoon representation (left) and 
schematically with /3-sheets as blocks (right), a) A single /3 sheet comprised of parallel /3-strands. 
b) /3-sheet observed along fibrillization axis, c) Assembled /3-sandwich (protofilament) consisting 
of two /3-sheets. Note the steric zipper interactions shown as interdigitating side chains (left) and 
as a green layer (right), d) Mature fibril consisting oi^ — 2 protofilaments. 
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FIG. 2: Model of amyloid aggregation equilibria. Each black line indicates the peptide backbone. 
Each red line shows one hydrogen bond. A) Isolated peptide monomers in solution. B) Oligomeric 
assembly of a few peptide chains. C) Nucleus of /3-sheet structure. The peptide backbone runs 
perpendicular to the fiber axis. D) Post-critical nucleus structure having more /3-structure. E) 
Protofilament is a single long thread of /3-structure consisting of a /3 -sandwich^ two /3-sheet planes 
face-to-face. F) Full fibril^ a bundle of protofilament s, shown here containing p — 2 protofilament 
threads. 
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FIG. 3: Schematic representation of the parameter Ug. Here each peptide chain contributes a) 
one, b) two, and c) four /3-strands to the fibril. For clarity, adjacent peptide chains are shown in 
alternating colors. 
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FIG. 4: Schematic representation of the free energy landscape described by our model at low 
(left), intermediate (middle), and high peptide concentration (right). Labels correspond to the 
states shown in Fig. [2l At low concentrations the monomer state (A) is the free energy minimum, 
while at high concentrations the fibril (F) is the minimum. At intermediate concentrations the 
solution is an equilibrium of monomers and oligomers (B). 
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FIG. 5: Phase diagram for peptides with p^Us — 1^ (^ — 15, In^ = 0.6, and ln7 = — 2 as a function 
of peptide concentration and \ng/x- Green =ci/co (monomers). Blue = CoHgo/co) (oligomers). 
Red =Cfibrii/co (fibrils). Lines depict phase boundaries computed from Eq. [241 (long dashes), Eq. 
[2U1 (short dashes), and Eq. [16] (solid). 
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FIG. 6: Plot of Cfihrii as a function of the bulk peptide concentration compared to CD data of 
Terzi et al. for A/3i_4o [22]. L = 26, n^ = 2, j9 = 2, 5 = 1.66 and 7 = 0.54 
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FIG. 7: Average length of fibrils vs. peptide concentration, and compared to experiments on 
Q^-synuclein [25] . 
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FIG. 8: Comparison of the computed fibril length distribution as a function of the bulk peptide 



concentration to the experimental distributions of a-synuclein 



25]. 



24 



5. 

4. - 
2^ 3.- 

2.^1 
1. 





( > monomer 


^■■■■■Ib < ^ oligomer 


^H fibril i fflR^^H 



-6.6 -6.2 -5.^ 



-5.4 



-4.6 -4.2 -3.8 -3.4 



Logio(co) 



FIG. 9: Phase diagram for A^ as a function of peptide concentration and urea concentration. 
The colors represent monomers (green), ohgomers (blue), and fibrils (red). A^ = 4, x^^ — 36.4 
J], and all other parameters are identical to Fig. [6l This diagram explains a discrepancy between 
the experiments of Chen and Glabe, where the black line indicates denaturation. They found no 
intermediate oligomers. For the Kim experiments, denaturation is indicated by the white line; they 
observed oligomeric intermediates. 
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FIG. 10: Predicted solubility of_ 
Data points dX q — 3.9 are from 



A/540 as a function of salt concentration and net peptide charge. 



32l | and the point dX q — 2.8 is from [2 
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FIG. 11: Cylindrical fibril geometry used to solve Eq. HOI Fibrils are taken as cylinders of radius 
R separated by a distance 2d. Red lines denote individual /3-strands (shown here for a fibril with 
p = 2). Horizontal disks show the spacing h between /3-strand layers within the fibril. Eq. HOI 
is solved from surface of the reference fibril (center), out to a distance d (outer cylinder) with 
the boundary condition t/j\d) = reflecting the symmetry of the electric potential between the 
cylinders. 
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